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Abstract 

We show that both reflected backward stochastic differential equation {reflected BSDE for 
short) and its associated penalized backward stochastic differential equation (penalized BSDE 
for short) admit both optimal stopping representation and optimal control representation. We 
also show that both multidimensional reflected BSDE and its associated multidimensional pe- 
nalized BSDE admit optimal switching representation. The corresponding optimal stopping 
(switching) problems for penalized BSDE have the feature that only stopping at exogenous 
Poisson arrival times is allowed. 
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1 Introduction 



El Karoui et al ||5) introduced penalized backward stochastic differential equation {penalized 
BSDE for short) to solve reflected backward stochastic differential equation {reflected BSDE for 
short), and they showed that the solution of a reflected BSDE corresponds to the value process 
of a nonlinear optimal stopping time problem. In this paper, our main result is to show that the 
solution of the associated penalized BSDE also corresponds to the value process of some nonlinear 
optimal stopping time problem, and the parameter A appearing in the penalized equation is 
nothing but the intensity of some exogenous Poisson process. 

Let (Wf)f>o be a d-dimensional standard Brownian motion defined on a filtered probability 
space {Cl,J^, {J^f}f>o, P) satisfying the usual conditions, i.e. the filtration {J-t}t>o is right continu- 
ous and complete. In El Karoui et al (5l, the authors introduced the following reflected BSDE: 

Yt = ^ + j'^ fs{Ys,Zs)ds + j\Ks- j'^ Zsd]Ns (1.1) 

under the constraints: 

(Dominating Condition) : Yt>St for t e [0, T], 

(Skorohod Condition) : j (Yf — St)dKt = for K continuous and increasing, 

where the terminal data the driver fs{y,z), and the obstacle (Sf)o<t<r ^re the given data for the 
equation. A solution to reflected BSDE dl.lll is a triplet (Y, Z, K), where X is a kind of "local time" 
process. The equation dl.ll corresponds to a backward Skorohod problem, which in turn gives 
the "local time" process K a Skorohod representation. See Qian and Xu flSl in this direction. 

On the other hand, as shown in |5|, also has an interesting interpretation in the sense 
that its solution is the value process of a nonlinear optimal stopping time problem: For any fixed 
time t G [0, T] , the value process of the following optimal stopping time problem: 

TAT 

f 



(1.2) 



yt = esssup^g7^(f) 

where the control set 7^(0 is defined by 

'R,{t) = {jr,-stopping times t > t}, 

is given by the solution to reflected BSDE yt = Yt a.s. for t G [0, T\. The optimal stopping 
time is given by t* = inf{s > t : Yg = Ss} /\T ■ The nonlinear optimal stopping problem (11.21 1 
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is closely related to pricing and hedging American options especially in constrained markets as 
shown in El Karoui et al |6). 

One way to solve reflected BSDE is to iterate the solution of the corresponding backward 
Skorohod problem by Picard iteration. The other way, which seems more commonly used in the 
literature, is to approximate the "local time" process K by 

= / Amax{0, S, - Y^}ds for t 6 [0, T], 
JO 

where (Y'^, Z^) is the solution of the following penalized BSDE: 

Y^ = ^ + fsiY,\Z^)ds + Amax{0, Ss - Y^jds - Z^dWs. (1.3) 

Under Assumption 11.11 introduced below. El Karoui et al ISJ proved that Y'^ is increasing in A, 
and 

' ~ 0. (1.4) 



lim E 

Atoo 



sup \Y^-Yt\^+ I \Zf-Zt\^dt+ sup 

J&[0,T] "'0 t6[0,T] 



Our aim is to give stochastic control representations for penalized BSDE | |1.3l l. Our main result 
is to prove that penalized BSDE (11.31 1 also admits an optimal stopping representation. We impose 
the following standard assumption on the data set f, S) as in |5ll: 

Assumption 1.1 • The terminal data f is L,^-square integrable: E[|^p] < oo; 

• The driver / : O x [0, T] x ]R x R'' ^ ]R is uniformly Lipschitz continuous: 

\ft{y,z) — ft{y,z) \ < C{\y — y\ + \z — z\) a.s.for some C > 0, 



and ft (0, 0) is M^-square integrable: E \ft (0, 0) \^dt 



< oo; 



The obstacle process S is continuous and uniformly square integrable: E supf^jg \St\ 



< 00. 



The above conditions can be relaxed. See, for example, Peng and Xu llTl and Lepeltier and Xu 
|T3| extending to RCLL obstacles, and Bayraktar and Song [Tj and Lionnet ||T6l among others 
extending to the driver fs{y,z) with quadratic growth in z. However, we only stick with the 
above standard assumption in this paper. Under the above standard assumption, we have the 
following representation which is the main result of this paper. 
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Theorem 1.2 Suppose that Assumption 11.21 holds. Denote {Y'^,Z^) as the unique solution to penalized 
BSDE (12. 3P . For any fixed time t G [0, T], define the control set TZ{t, A) as 

TZ{t,X) = {Qs-stopping time t > t : r = Tnfor some n > 1} , 

where {Tn}n>o '"'^ arrival times of a Poisson process {Ns)s>t with intensity A and minimal augmented 
filtration {'Hs}s>t, and Qs = J-s^f 'Hs- Then the value process of the following optimal stopping time 
problem: 



tAT 



/,(y^, Zi)ds + SA{,<T} + ?l{x>T} l-^t 



(1.5) 



is given by the solution to penalized BSDE ( 12. 3P ; yf = a.s. for t e [0, T]. The optimal stopping time 
is given by t* = mf{r„ > f : Y^^^ < St„ } AT. 

Note that the value process y'^ is increasing in A, because the larger A is, the more stopping 
opportimities we have. By the convergence l|1.4b and Theorem ll.2i the two value processes defined 
in (|1.2t and dl.St are related by 



lim E 



sup \yf-yt\^ 



0. 



(1.6) 



The proof of Theorem 11.21 is provided in Section |2l Based on the above representation result 
and the idea of randomized stopping by Krylov |[12J, we also give an optimal control representa- 
tion for penalized BSDE, which is formulated and proved in Section |3l In Section IH we extend 
our representation results to multidimensional case, and give an optimal switching representa- 
tion for multidimensional penalized BSDE associated with multidimensional reflected (oblique) 
BSDE. Finally, Section 5 concludes. 



2 Proof of Theorem IM 



The optimal stopping time problem (11.51 1 has a constraint on its control set, i.e. the optimal 
stopping time must be chosen from the arrival times {Tn}n>i of a Poisson process N. Given the 
arrival time T,,, by defining pre-T^ cr-field: 

ar„(f,A) = Ia e y Gs : An {T„ < s} e Gs ioT s > f.j , 
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it is easy to show that the problem l Il.SI l is equivalent to the following discrete optimal stopping 
time problem where the control constraint does not appear: 



= esssup^,g^(f^^)I 



(2.1) 



where 

J\f{t,A) = |t77j^(f, A)-stopping time N : N = n for some integer n > 1} . 

Note that l|2.1b is a discrete optimal stopping problem, as we are only allowed to stop at a se- 
quence of deterministic times, the integers (m)„>i. The optimal stopping time is some integer N* 
such that T* = Tf4* A T. If Tj^* > T, we define the optimal stopping time as terminal time T by 
convention. 



2.1 Representation for Linear Case 



In this section, we consider the case where the driver fs{y,z) is independent of (y, z), and 
simply write it as fs in such a situation. Note that the corresponding reflected BSDE becomes 
linear, and its optimal stopping time problem l ll.2|l also becomes linear. 

The following lemma is the key to prove Theorem 11.21 Basically, it says that we only need to 
consider the optimal stopping time problem \2.1\ up to the first Poisson arrival time T\. 

Lemma 2.1 Suppose that Assumption \1.1\ holds, and that fs{y,z) = fs- Then the value process of the 
optimal stopping time problem ( |2.3| > satisfies the following dynamic programming equation: 



Vt 



TiAT 



fsds + max |sTi,yTi } l{t<ri<T} + ^l{Ti>r} l-^f 



Proof. We first introduce an auxiliary optimal stopping time problem: 



Vt = esssupj^£_^(j^^)l 



(2.2) 



(2.3) 



fsds + STf^l^T^^T} + ?l{T,v>T} l-^t 

where 

N'{t,A) = (f, A)-stopping time N : N = n for some integer n > 0} . 

The difference between | |2.3l l and | |2.HI is that the former is allowed to stop at the starting time 
t, while the later not. Due to the additional stopping opportunity of problem ( 12. St , we have the 
following relationship between the two value processes: 



Vt 



max |esssupj^g^(f^^jE 
max St I for t e [0,T]. 



fsds 



■^l{T,v>T}|-^t 



,St 



(2.4) 
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Moreover, if N* is the optimal stopping time for problem | |2.1|| starting from time f, by time 
consistency and Il2.4b , N* is also the optimal stopping time for problem | |2.3| | starting from the 
first Poisson arrival time Ti : 



fsds + St^,1{t^,<t} + ^l{T^,.>T}l-^ri 



If IV* is the optimal stopping time for problem (|2.1b , by conditional on the first Poisson arrival 
time Tj, we have 

r-Tjj, AT 



fsds + Sr^,l{x^,<j} + ?1|T^,>T}| J-Ti 



(l{f<Ti<T,ri<T,vt<T} + l{t<ri<T,r,v*>T} + 1{Ti>T} 



r„* AT 



/sds + ST^,l{r^^<r} +?l{Tjv*>T} I I-^Ti 



= E 



l{t<ri<T}E 



47-i<Tjj.<T} 



+ l{t<Ti<T}E 
+l{Ti>T}E 



l{r^*>T} / fsds + I fsds + ^)\TT, 



fsds + ^\FT, 



l{t<Ti<r}^l + l{t<Ti<T}^2 + l{ri>T}^3|-^f 

where we used the fact that on {Ti > T}, we must have {T^* > T}. Moreover, 

rT.,* 



{Ti<T^.<T} /s^^S + Sr^.j I^Ti 



and 



/2 = P(r^ > T|7-tJ / fsds + E 



By plugging Ii and /2 into the expression for yf, we obtain 



= E 



" i{t<Ti<T}r 



/sds + St^,1{Ti<Tj,.<T} + ?l{T^,*>T}l-^Ti 



+1 



{Ti>T} 



fsds + n\Tt 



Note that the second term on the RHS of the above equality is 

r^NAT 



esssup^g^(^^^^)i 



fsds + Stj,1{t^<t} + ?l{TN>T}l-^ri 



which by definition is i/j^. The result then follows from the relationship l|2.4l l. ■ 

Next, we make use of the density function \e~^^^~*'^dx of the first Poisson arrival time Tj to 
simplify dynamic programming equation l|2.2|l . 



fsds\Tt 



-A(s-t) 



fsds\Tt 



where we used integration by parts in the second equality. Moreover, 



max 

T 



{^Ti/l/ri} l{f<Ti<T} +?l{Ti>T}|-^t 

Ae^^('^"'' max [Ss,y^] ds + e-^(^-')^| j^t 



Hence, l|2.21 can be further simplified to 



/s + Amax {Ss,y^^}) ds + J-f 



(2.5) 



Given the dynamic programming equation | |2.5| |, we have the following representation result 
for the value procss {yf)o<t<T/ which proves the linear case of Theorem ll.2l 

Lemma 2.2 Suppose that Assumption 12.31 holds, and that fs{y,z) = fs- Denote (Y''\Z'^) as the unique 
solution to the following penalized BSDE: 



fT fT rT 

^+ fsds+ Amax{0,Ss-Y^}ds- Z', 
Jt Jt Jt 



dW,. 



Then yf = a.s. for t e [0, T]. 

Proof. We introduce the dual equation for Y"^: 



at = l- / Aocsds for t e [0,7]. 
JO 



By applying Ito's formula to octY^ , we obtain 

dtY^ = ajY^ + ccs (/s + AY^ + Amax{0, Ss - Y^}) ds - ctsZ^ 



dW, 



so that 



Oit 

E 



^Z^dW, 

t Oit 



k + ^(/s+Amax{s«,Y,^})ds 
e-HT-t)^^ j\-A{s-t) (/^+Amax{Ss,Y;\})rfs|J-f 
which is exactly dynamic programming equation | |2.5l l. ■ 



2.2 Representation for Nonlinear Case 

In this section, we extend the representation result Lemma [2. 21 to the nonlinear case, and give 
the proof of Theorem II. 21 

Denote {Y'^,Z^) as the unique solution to penalized BSDE | |1.3| |. Consider the optimal stop- 
ping time problem I I2.1II : 

, r /-TwAr 

1/t =esssup^g^((^;^)E /s(y, ,Zjds + ST^l{r^<r} + ?l{TN>r}l-^f 

By Lemma I2l2l y'^ admits the following BSDE representation: 

= ? + / Vs(n^ Z^)ds + J\max{0, Ss - y^jds - j\^dW,. 
On the other hand, {Y^,Z^) satisfies penalized BSDE | [L3l l: 

y^=^+ r fsiYs, ^s)ds + r A max{0, Ss - Y^^ }ds - C Z^dWs. 



Define 



SY^ =yf - Y^; SZf =zf - Z^. 



Then {5Y'^,6Z^) satisfies the following linear BSDE: 

SYf- = A^sSY^ds - SZ^dWs (2.6) 

with 

_ max{0, Ss - y^} - max{0, S, - Y,^} ^ 

Obviously, |/5s| < 1/ so BSDE | |2.6| | admits a unique solution (see for example fTj for the proof). 
On the other hand, SY(^ = SZf = is an obvious solution to BSDE | |2.6| |. Therefore, we conclude 
that = Y^ a.s. for t e [0, T], which proves Theorem ll.2l 



3 Optimal Control Representation 

Krylov in flZI showed that optimal stopping for controlled diffusion processes can always be 
transformed to optimal control by using randomized stopping. See also Gyongy and Siska (§1 for 
its recent development. In this section, our aim is to give optimal control interpretations of both 
reflected BSDE (Tl) and penalized BSDE i fOl l. 
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Let's first recall the basic idea of Krylov's randomized stopping. For simplicity, we only 
consider the linear case fs{y,z) = fg. For any fixed time t G [0, T], consider a nonnegative control 
process {rs)s>t- Let the payoff fimctional Jj fsds + S. stop with intensity VsA in an infinitesimal 
interval (s, s + A). Then the probability that stopping does not occur before time s is 



e -It 



The probability that stopping does not occur before time s and does occur in an infinitesimal 
interval (s, s + A) is 

e- It A. 

Therefore, the payoff functional associated with the control process r from [t, T] is given by 

where the first term is the payoff if stopping does occur before time T, and the second term 
corresponds to the payoff if stopping does not occur in the time interval [t, T] . By applying 
integration by parts, the payoff functional is further simplified to 

We have the following optimal control representation for penalized BSDE | |1.3| |. 

Proposition 3.1 Suppose that Assumption \l.l\ holds. Denote ( Y"^, Z'^ ) as the unique solution to penalized 
BSDE il-3h For any fixed time t E [0, T], define the control set A{t,X) as 

A{t,\) = {J^s-cidapted process {rs)s>t '■ rg = or A} . 



Then the value process of the following optimal control problem: 

rT 



yf-= sup E 



{fs{Y^,Z^)+rsSs)e-lt'-uduas + e-f''"'^"^\Tt 



(3.1) 



is given by the solution to penalized BSDE ( 13. 3P ; yf = a.s. for t E [0, T]. The optimal control is given 

Proof. We only consider the linear case fs (y, z) = fg, as the proof for the nonlinear case fs (y, z) 
is the same as the proof of Theorem ll.2l 
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First, analogous to Lemma 12. 2[ it is easy to show that the expected payoff process associated 
with any fixed control r E A{t,\), 

rT 



yf{r) = E 



is the unique solution to the following BSDE 

rT 



Vt 



Since the control r only appears in the driver, by the BSDE comparison theorem (see for example 
Q)/ = s^PreA(t A)yf(^) satisfies the following BSDE: 



yt=^- 



fs+ sup rs{Ss-y^ 

r.6{0,A} 



The optimal control is r* = Al^^^A^g |, and with such an optimal control r*, the driver becomes 
/s + A max{0, Ss — J/^}/ which is the driver of penalized BSDE < |1.3l l. By the uniqueness, yf = 
a.s. for f e [0,T]. ■ 

The optimal control representation for reflected BSDE l Il.Hl is the same as | |3.1l l except that the 
control set is change to A{t) = U^A{t,A). The value process of the following optimal control 
problem: 

yt= sup e\ r{fs{Y^,Z^)+rsSs)e~It'"'^"ds + e~It'-"^"^\Tt] (3.2) 
IS given by the solution to reflected BSDE dLlll: yt = Yt a.s. for f e [0, T]. 



4 Optimal Switching Representation for Multidimensional Pe- 
nalized BSDE 

Multidimensional Reflected BSDE was firstly introduced by Hamadene and Jeanblanc l9l , 
where they used its solution to characterize the value process of an optimal switching problem, 
in particular in the setting of power plant management. The related equations were solved 
by Hu and Tang IITTl using penalty method, and by Hamadene and Zhang IfTOl using iterated 
optimal stopping time method. See also Chassagneux et al ||2] for its recent development. A 
miiltidimensional reflected BSDE is a d-dimensional system, where each component 1 < i < d 
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representing the region i: 

y't = V + fl{YlA)ds + J^^dKl - jl'zldWs (4.1) 



xinder the constraints: 

(Dominating Condition) : Y/ > MYj for f e [0, T], 

rT 



(Skorohod Condition) : J (V/ — AiYDdK] = for K' continuous and increasing, 

where 

MYi = max{y/ - cf } 

representing the payoff of switching to region from region /. The terminal data the driver 
/j(i/',z') and the switching cost (Cs'^)o<s<r are the given data. Different from one-dimensional 
reflected BSDE whose solution must stay above an obstacle process, the solution of multidimen- 
sional reflected BSDE (14.111 evolves in the random closed convex set: 



|y e M'^ : y' > rnax{y - } 

The following standard assumption on the data set {^',f\ O'^) is imposed in Hu and Tang IITTll : 
Assumption 4.1 • The terminal data f ' and the driver fl [y', z') satisfy Assumption UM 



• The sivitching cost (C'-')i</^^<^ satisfy (i) C" = 0; (ii) inff^p C'/ > C > for i ^ j; and (Hi) 
infje[o,r] cf + cf - Cf > C> Ofor i^j^ I. 

The condition on the driver fg{y',z') can be relaxed. For example, in Hamadene and Zhang fTOl 
and Chassagneux et al [2], the driver is even allowed to be coupled in y, i.e. having the form 

Under Assumption 14. li Hu and Tang ITTH proved that the solution to multidimensional re- 
flected BSDE l|4.1b corresponds to the value process of an optimal switching problem. Indeed, 
introduce the control set /C'(f) as 

^'(0 = { -^s-adapted process {us)s>t ■ Us = il{f}(s) + ^ a;tl(TfoT,,+i] (s) \ , 

where 
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• {''^k)k>o is an increasing sequence of J-l-stopping times valued in [t, T] with tq = t and 
Tn = T for some integer-valued random variable N. 

• (ayt)*:>0 is a sequence of random variables valued in {1, ■ ■ ■ , rf} such that a^t is J-V^, -measurable, 
and DiQ = i. 

Then the value process of the following optimal switching problem: 



j/f = sup E 

uelC'{t) 



k>l 



Ht<r,<T}\J^t 



(4.2) 



is given by the solution to multidimensional reflected BSDE (|4.1b : y\ = Yl a.s. for t e [0, T]. 

On the other hand, Hu and Tang fTTll introduced the following multidimensional penalized 
BSDE to approximate and solve multidimensional reflected BSDE l|4.11 : 

y;-^ = + ^^/i(yi'^ Zp)ds + j\max{0,MY!.'^ - yp}ds - ZpdWs, (4.3) 

and they proved that under Assumption 14. li Y''^^ is increasing in A, and 

rT 



lim E 

A too 



sup \Yi'^ - y/l^ + / izy' - Z\\^dt + sup {K]'" - 
fe[o,r] •'0 fe[o,r] 



0. 



Our aim in this section is to give a stochastic control interpretation of multidimensional pe- 
nalized BSDE lO. 



Proposition 4.2 Suppose that Assumption 14.11 holds. Denote {Y^'^,Z^'^) as the unique solution to mul- 
tidimensional penalized BSDE i4.3i . For any fixed time t e [0, T], define the control set lC{t, A) as 

/C'(f, A) = {m e /C'(t) : u is Qs-adapted, and = T/, A T for k > 0}, 

where {Tn)n>o arrival times of a Poisson process {Ns)s>t ^^^Hh intensity A and minimal augmented 

filtration {7^s}s>t/ ^^nd Qs = J-s^ 'Hg- Then the value process y'''^ of the following optimal switching 
problem: 



sup E 

uelC'{t,A) 



f-'^iYy,Zy)ds + ^"^ - E Ct"''''L{t<r,<T}\J't 

k>l 



(4.4) 



is given by the solution to i4.3i : y]'^ = Y^'^ a.s. for t G [0, T]. 
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Proof. The proof is similar to that of Theorem 11.21 so we only sketch it. The first step is to 
consider the case that the driver fl{y\z') is independent of {y',z'), which is then denoted as /g. 
The key step is to prove the following d5mamic programming equation for the value process y'''^ : 



fids + max {My'i^,y'i^} l^.^^j} + ^%,=T}\^t 



(4.5) 



which is in the same spirit of dynamic programming equation i2.2} in Lemma 12.11 Indeed, 
introduce an auxiliary optimal switching problem: 

rT 



iA 



sup E 



■It )C>1 



Ht<Tk<T}\^t 



(4.6) 



where the control set KL'{t,\) is the same as /C'(f, A) except that we do not require ocq = i. In 
other words, we are allowed to switch at the starting time t. Due to this additional switching 
opportunity, we have the following relationship between the two optimal switching problems: 



yY^ = max 



{My'^^y't'}- 



(4.7) 



Moreover, if ii* = !l|f|(s) + Eyt>o ''^k'^{Tl:,T^^^] (^) for s > t is the optimal switching strategy for 
| |4.4|| starting from time t, by time consistency and l l4.7t , u* for s > ti is also the optimal switching 
strategy for | |4.6| | starting from the first possible switching time . 

If u* for s > Hs the optimal switching strategy for l|4.4|l , by conditional on the first possible 
switching time Ti, we have 



k>l 



l{t<Ti<r,Ti<T;<r} + l{f<Tl<T,T;=T} +1{ti=T} 



/^*ds + r^-Ec:f'S<x;<T})l-^. 



k>l 



l{t<Ti<r}E 



+ l{f<Ti<T} 

+1{T,=T}E 



4ti<t*<T} 



/ids- 



IJ't 



k>l 



fids- 



flds + ^'\\T,^ 



fids + ^'\Fr, 



{f<Ti<T}^l + i{t<Ti<T}^2 i{Ti=T}J3 
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where we used the fact that on {tj = T}, we must have {t^ = T}. Moreover, 
/i =P(ti <Ti* < TI^^J 
and 



By plugging l\ and li into the expression for , we obtain 



l{f<Ti<T} / /s'rfs 



+ l{t<Ti<r}E 



\ 1 *:>! / 



Note that the second term of the above expression is 



sup E 

^g£'(ti,A) 



which by definition is The result then follows from the relationship I l4.7t . 

The rest of the proof follows the same as the proof for the optimal stopping time representa- 
tion of one-dimensional reflected BSDE in Theorem II. 21 ■ 

5 Conclusion 

In this paper, we find the stochastic control representations of (multidimensional) reflected 
BSDE and penalized BSDE, which are summarized in the following table. 



Table 1: Stochastic Control Representations of Reflected BSDE and Penalized BSDE 

Optimal stopping representation Optimal control representation 
Reflected BSDE (H with t e (|32) with r e A{t) 

Penalized BSDE JTS) with t 6 7^(f, A) with r e A{t,A) 

Multidimensional Reflected BSDE g2) with ii e fC'it) 

Multidimensional Penalized BSDE (44) with u eK.'{t,\) 
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The main feature of the related optimal stopping (switching) representations is that only 
stopping at arrival times of some exogenous Poisson process is allowed. Such kind of optimal 
stopping with random intervention times was firstly considered by Dupuis and Wang fH, where 
they used it to model perpetual American options. Since their problem is one dimensional and 
with infinite time horizon, they do not even need to introduce penalized equation. Instead, they 
worked out two ordinary differential equations (ODE for short) defined in continuity region and 
stopping region respectively. Recently, Liang et al fl5ll find that such kind of optimal stopping 
with random intervention times can be used to model bank run problems. In the setting of 
American options, Dai et al fS) intuitively showed that penalty method is closed related to inten- 
sity framework. However, they did not introduce any stochastic control interpretations for their 
penalty method. 

Finally, it seems that the only existing representation result for penalized BSDE was given by 
Lepeltier and Xu |T3ll and 1141^1, where they found a connection between penalized BSDE and a 
standard optimal stopping problem with modified obstacle mm{St,Y^}. Our represent results 
are different, and seem more natural: Penalized BSDE is nothing but a random time discretization 
of an optimal stopping time problem, where the time is discretized by Poisson arrival times. 
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